Toward Improved and Transparent Imputation Techniques for Online Traffic Data Streams and Archiving Applications

نویسندگان

  • Rafael J. Fernández-Moctezuma
  • Robert L. Bertini
  • David Maier
  • Kristin A. Tufte
چکیده

A diverse range of measurements collected from the transportation infrastructure facilitate day to day operation, surveillance, forecasting, and dissemination of current condition information to the general public. The mechanisms for assessing current system conditions rely on multiple sensor types and mobile probes. The quality and completeness of traffic data is generally regarded as suboptimal. Several techniques are used in the transportation industry to cope with incomplete or suspiciously erroneous data, in particular the imputation of missing values in a range of types of traffic databases, streams and archives. The objective of this paper is to identify and categorize common imputation techniques reported in the transportation literature. This review will be discussed and presented in the context of a notional system that performs online imputation for traffic data streams. Exemplar uses of traffic data include traveler information systems and traffic management applications.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Classification of encrypted traffic for applications based on statistical features

Traffic classification plays an important role in many aspects of network management such as identifying type of the transferred data, detection of malware applications, applying policies to restrict network accesses and so on. Basic methods in this field were using some obvious traffic features like port number and protocol type to classify the traffic type. However, recent changes in applicat...

متن کامل

High-Speed Distributed Data Handling for High-Energy and Nuclear Physics1

The advent (and promise) of shared, widely available, high-speed networks provides the potential for new approaches to the collection, organization, storage, and analysis of high-speed and high-volume data streams from on-line instruments. Such data streams originate from many types of on-line instruments and imaging systems, and are a “staple” of modern scientific, health care, and intelligenc...

متن کامل

Missing data imputation in multivariable time series data

Multivariate time series data are found in a variety of fields such as bioinformatics, biology, genetics, astronomy, geography and finance. Many time series datasets contain missing data. Multivariate time series missing data imputation is a challenging topic and needs to be carefully considered before learning or predicting time series. Frequent researches have been done on the use of diffe...

متن کامل

Toward Comprehensive Traffic Generation for Online IDS Evaluation

We describe a traffic generation framework for conducting online evaluations of network intrusion detection systems over a wide range of realistic conditions. The framework integrates both benign and malicious traffic, enabling generation of IP packet streams with diverse characteristics from the perspective of (i) packet content (both header and payload), (ii) packet mix (order of packets in s...

متن کامل

Development of an Imputation Strategy for the Portal Adus

The PORTAL ADUS stores traffic data reported by sensor station deployed along the freeway system of the Portland, Oregon metropolitan area. In addition to data retrieval interfaces, PORTAL provides end user applications such as congestion maps. The utility of end user applications is subject to the availability of data. Traffic data is generally known to suffer from a variety of problems, inclu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008